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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: LI, ET AL. 



(ii) TITLE OF INVENTION: Connective Tissue Growth Factor-2 



(iii) NUMBER OF SEQUENCES: 2 



(iv) CORRESPONDENCE ADDRESS: 



/ 



(A) ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 

CECCHI / STEWART & OLSTEIN 

(B) STREET: 6 BECKER FA^RM ROAD 

(C) CITY: ROSELAND / 

(D) STATE: NEW JERSEY 

(E) COUNTRY: USA 

(F) ZIP: 07068 



(v) 



COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: A. 5 INCH DISKETTE 

(B) COMPUTER: IBM PS/2 

(C) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: /WORD PERFECT 5.1 



(vi) 



CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: Concurrently 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA 

(A) APPl/lCATION NUMBER: PCT/US94/07736 

(B) FILING DATE: 12 JUL 94 
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(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: FERRARO, GREGORY D. 

(B) REGISTRATION NUMBER: 36,134 

(C) IlEFERENCE/DOCKET NUMBER: 32580(^-317 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-994-1700 

(B) TELEFAX: 201-994-1744 

(2) INFORMATION FOR SEQ ID NO:l: 

(i)- SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1128 BASE PA^RS 

(B) TYPE: NUCLEIC ACID ; 

(C) STRANDEDNESS : SINg 

(D) TOPOLOGY: LINEAR/ 



(ii) 



MOLIICULE TYPE: CD! 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO:l: 



ATGAGCTCCC 
GTGGGGCTCT 
CCGGGAGTCG 
AACGAGGACT 
GGCGCCAGCT 
GAATATAACT 
TGCACATGTA 
CCCCCCAACT 
GAGTGGGTCT 
GGCAAGGGGC 
GCAGTTGGAA 
CTATACAACC 
TCAAAGACCT 
CTTGTGAAAG 
CTGAAAAAGG 
TACGCTGGAT 



GAATCGTCAG 
CCACCTGCCC 
GGCTCGTCCG 
GCAGJAAAAC 
CCACCGCTCT 
CCAGJlATCTA 
TTGGJlTGGCG 
TGGGCTGTC 
GTGAO 
TGGGATT; 
AAGGZAJSCTC 
CTTT. 

AACTGG 
[CCCGGAT 
GlCAflGAAATG 
[TTIGAGTGT 




GGAGCTCGCC 
CGCTGACTGC 
GGACGGCTGC 
GCAGCCCTGC 
tfC 
.CGGG 
ICGGGGGGCT 
CAACCCTCGG 
TAGTATCAAG 
TGCCTCCGAG 
ACTGAAGCGG 
CCAGAAATGT 
TATCTCCACA 
TTGTGAGGTG 
CAGCAAGACC 
GAAGAAATAC 



TTAGTCGTCA 
CACTGCCCCC 
GGCTGTTGTA 
GACCACACCA 
TGCAGAGCTC 
GAAAGTTTCC 
TGCATTCCTC 
CTGGTCAAAG 
GACCCCATGG 
GTGGAGTTGA 
CTCCCTGTTT 
ATTGTTCAAA 
CGAGTTACCA 
CGGCCTTGTG 
AAGAAATCCC 
CGGCCCAAGT 



CCCTTCTCCA 

TGGAGGCGCC 

AGGTCTGCGC 

AGGGGCTGGA 

AGTCAGAGGG 

AGCCCAACTG 

TGTGTCCCCA 

TTACCGGGCA 

AGGACCAGGA 

CGAGAAACAA 

TTGGAATGGA 

CAACTTCATG 

ATGACAACCC 

GACAGCCAGT 

CCGAACCAGT 

ACTGCGGTTC 



CTTGACCAGG 

CAAGTGCGCG 

CAAGCAGCTC 

ATGCAACTTC 

CAGACCCTGT 

TAAACATCAG 

AGAACTATCT 

GTGCTGCGAG 

CGGCCTCCTT 

TGAATTGATT 

GCCTCGCATC 

GTCCCAGTGC 

TGAGTGCCGC 

GTACAGCAGC 

CAGGTTTACT 

CTGCGTGGAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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(I 



GGCCGATGCT GCACGC'ZCCA GCTGACCAGG ACTGTGAAGA TGCGGTTCCC OTGCGAAGAT 
GGGGAGACAT TTTCCAAGAA CGTCATGATG ATCCAGTCCT CCAAATGCAA ^TACAACTGC 
CCGCATGCCA ARGAAG IAGC GTTTCCCTTC TACAGGCTGT TCCAATGA 



1020 
1080 
1128 



(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQU13NCE CHARACTERISTICS 

(A) LENGTH: 375 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) STRANDEDNESS : 

(D) TOPOLOGY: LINEAR 



(ii) 



MOLECULE TYPE: PROTEIN 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Ser Ser Ai g He Val Arg Glu Leu Ala Leu Val Val Thr Leu 

-20 / -15 " 10 

Leu His Leu Tlir Arg Val Gly Leil Ser Thr Cys Pro Ala Asp Cys 

- 5 /l 5 

His Cys Pro Lem Glu Ala Pro Lys Cys Ala Pro Gly Val Gly Leu 

:.0 / 15 20 

Val Arg Asp Gly Cys Gly Cyfc Cys Lys Val Cys Ala Lys Gin Leu 

:>5 7 30 35 

Asn Glu Asp C/b Arg Lys Thr Gin Pro Cys Asp His Thr Lys Gly 

40 / 45 ' 50 

Leu Glu Cys Asn Phe GLy Ala Ser Ser Thr Ala Leu Lys Gly He 

55 / 60 65 

Cys Arg Ala Gin Ser /Glu Gly Arg Pro Cys Glu Tyr Asn Ser Arg 

70 / 75 80 

lie Tyr Gin Asn Gly Glu Ser Phe Gin Pro Asn Cys Lys His Gin 

85 / 90 95 

Cys Thr Cys Ile/Gly Thp Arg Arg Gly Ala Cys He Pro Leu Cys 

1.0/ 105 HO 

Pro Gin Glu Leu Ser Leu Pro Asn Leu Gly Cys Pro Asn Pro Arg 
.15 120 125 
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Gli/ Glu Trp Val Cys Asp 

140 

ip Gin Asp Gly Leu Leu 
155 

Glu Val Glu Leu Thr Arg 

170 

Gly Ser Ser Leu Lys Arg 

185 

lie Leu Tyr Asn Pro Leu 

200 

Thr Ser Trp Ser Gin Cys 

215 

Thr Arg Val Thr Asn Asp 

230 

Thr Arg lie Cys Gly Val 

245 

Ser Leu Lys Lys Gly Lys 

260 

Glu Pro Val Arg Phe Thr 

275 

Tyr Arg Pro Lys Tyr Cys 

290 

Thr Pro Gin Leu Thr Arg 

305 ... 

Asp Gly Glu Thr Phe Ser 

320 

Lys Cys Asn Tyr Asn Cys 

335 

Phe Tyr Arg Leu Phe Gin 

350 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



APPLICANT : LI , HAODONG 

ADAMS, MARK D 




(ii) TIT\E OF INVENTION: CONNECTIVE TISSUE GROWTH FACTOR-2 
(iii) NUMBER. OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: HUMAN GENOME SCIENCES, INC. 

(B) STREET: 9410 KEY WEST AVENUE 

(C) CITY: ftOCKVILLE 

(D) STATE: MD 

(E) COUNTRY :\US 

(F) ZIP: 2085C 

(v) COMPUTER READABLE^ FORM : 

(A) MEDIUM TYPE: Vloppy disk 

(B) COMPUTER: IBM ^C compatible 

(C) OPERATING SYSTEId: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent\n Release #1 . 0 , ' Version #1.30 



(vi) CURRENT APPLICATION 

(A) APPLICATION NUMBER :\ 0#/4 59, 101 

(B) FILING DATE: 02-^UN; 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFC 

(A) NAME: A. ANDERS BROOKES 

(B) REGISTRATION NUMBER: 36,3^3 

(C) REFERENCE /DOCKET NUMBER: P5;126P1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 301-309-8504 

(B) TELEFAX: 301-309-8439 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . . 1122 



c--> 



# 
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(xi) 1 SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



ATG AGC TCCE CGA ATC GTC AGG GAG CTC GCC TTA GTC GTC ACC CTT CTC 
Met Ser SerYArg lie Val Arg Glu Leu Ala Leu Val Val Thr Leu Leu 
1 \ 5 10 15 



48 



CAC TTG ACC AGG GTG GGG CTC TCC ACC TGC CCC GCT GAC TGC CAC TGC 
His Leu Thr ArgWal .Gly Leu Ser Thr Cys Pro Ala Asp Cys His Cys 
20 \ 25 30 



96 



CCC CTG 
Pro Leu 



GGC TGC 
Gly Cys 
50 



GAG GCG CCC AAG TGC GCG CCG GGA GTC GGG CTG 
Glu Ala Pro\Lys Cys Ala Pro Gly Val Gly Leu 
35 \ 40 45 

GGC TGT TGT AAG GTC TGC GCC AAG CAG CTC AAC 
Gly Cys Cys Lys Val Cys Ala Lys Gin Leu Ash 
\ 55 60 



GTC CGG GAC 
Val Arg Asp 



GAG GAC TGC 
Glu Asp Cys 



144 



192 



AGA AAA ACG CAG CCC TGC GAC CAC ACC AAG GGG CTG GAA TGC AAC TTC 
Arg Lys Thr Gin Pro Cys Asp His Thr Lys Gly Leu Glu Cys Asn Phe 
65 70 \ 75 80 



240 



GGC GCC AGC TCC ACC GCT CTG AAG GGG ATC TGC AGA GCT CAG TCA GAG 
Gly Ala Ser Ser Thr Ala Leu Lys Gly lie Cys Arg Ala Gin Ser Glu 
85 \ 90 95 



288 



GGC AGA 
Gly Arg 



TTC CAG 
Phe Gin 



GGG GCT 
Gly Ala 
130 

GGC TGT 
Gly Cys 
145 



CCC TGT GAA TAT AAC TCC AGA ATC TAC CAA AAC 
Pro Cys Glu Tyr Asn Ser Arg lie Tyr Gin Asn 
100 io\ 



CCC AAC TGT AAA CAT fAG 
Pro Asn Cys Lys His Gin 
115 120 




ATT GGA 
lie Gly 
125 



TGC ATT CCT CTG TGTV CCC CT R^GAAYCTA TCT CTC 

Cys lie Pro Leu Cys Pro Gin Glu Leu Ser Leu 
135 \ 140 

CCC AAC CCT CGG CTG GTC AAA GTT ACQ GGG CAG 

Pro Asn Pro Arg Leu Val Lys Val Thr\Gly Gin 



150 



155 



GGG GAA AGT 
Gly Glu Ser 
110 

TGG CGC CGG 
Trp Arg Arg 



CCC AAC TTG 
Pro Asn Leu 



TGC TGC GAG 
Cys Cys Glu 
160 



336 



384 



432 



480 



GAG TGG GTC TGT GAC GAG GAT AGT ATC AAG GAC CCC ATG GAG GAC CAG 
Glu Trp Val Cys Asp Glu Asp Ser lie Lys Asp Pro\Met Glu Asp Gin 
165 170 \ 175 



528 



GAC GGC CTC CTT GGC AAG GGG CTG GGA TTC GAT GCC T<SC GAG GTG GAG 
Asp Gly Leu Leu Gly Lys Gly Leu Gly Phe Asp Ala SeV Glu Val Glu 
180 185 \ 190 



576 



TTG ACG AGA AAC AAT GAA TTG ATT GCA GTT GGA AAA GGC 
Leu Thr Arg Asn Asn Glu Leu lie Ala Val Gly Lys Gly 
195 200 205 



«AGC TCA CTG 
Ser Ser Leu 



624 



40 



.4= 

89 
ffl 



•i3=f 



AAG CGO. CTC CCT GTT 
Lys Arg \eu Pro Val 
210 

TTA CAA GGC\:AG AAA 
Leu Gin Gly G\n Lys 
225 



TTT GGA 
Phe Gly 
215 

TGT ATT 
Cys lie 
230 




TGT GGA CAG 
Cys Gly Gin 
275 

AAG ACC AAG 
Lys Thr Lys 
290 

TTG AGT GTG 
Leu Ser Val 
305 

GGC CGA TGC 
Gly Arg Cys 



CCC TGC GAA 
Pro Cys Glu 



TCC TCC AAA 
Ser Ser Lys 
355 




TCA AAG ACC TGT 
Ser Lys Thr Cys 



CCT GAG TGC CGC CTT 
ro Glu Cys Arg Leu 
260 



ACT GGT 
Thr Gly 



GTSG AAA 
Va\ Lys 



ATG GAG 
Met Glu 



GTT CAA 
Val Gin 



ATC TCC 
lie Ser 



GAA ACC 
Glu Thr 
265 



CCA GTG 
Pro Val 



AAA TCC 
Lys Ser 



AAG AAA 
Lys Lys 



TGC ACG 
Cys Thr 
325 

GAT GGG 
Asp Gly 
340 

TGC AAC 
Cys Asn 



TAC AGC AGC CTG 
Tyr SenSer Leu 



N^80 



CCC GAA CCA GTC 
Pro Glu Pro\Val 
295 

TAC CGG 
Tyr Arg Pro 
310 

CCC C 
Pro Gin 



GAG AC A TTT TCC 
Glu Thr Phe Ser 
345 



CCT CGC 
Pro Arg 



ACA ACT 
Thr Thr 
235 

ACA CGA 
Thr Arg 
250 

CGG ATT 
Arg lie 



AAA AAG 
Lys Lys 



AGG TTT 
Arg Phe 



ATC CTA TAC 
lie Leu Tyr 
220 

TCA TGG TCC 
Ser Trp Ser 



GTT ACC AAT 
Val Thr Asn 



TGT GAG GTG 
Cys Glu Val 
270 



AAC CCT 
Asn Pro 



CAG TGC 
Gin Cys 
240 

GAC AAC 
Asp Asn 
255 

CGG CCT 
Arg Pro 




GGC AAG AAA TGC AGC 
Gly Lys Lys Cys Ser 
285 

ACT TAC GCT GGA TGT 
Thr Tyr Ala Gly Cys 
300 

GGT TCC TGC GTG GAC 
Gly Ser Cys Val Asp 
320 

GTG AAG ATG CGG TTC 
Val Lys Met Arg Phe 
335 

GTC ATG ATG ATC CAG 
,Val Met Met lie Gin 
350 



TAC AAC TGC CCG CAT GCC AAG, AAG CAG CGT TTC 
Tyr Asn Cys Pro His Ala LysNLys Gin Arg Phe 
360 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



CCT TCT ACA GGC TGT TCC AATGA 
Pro Ser Thr Gly Cys Ser 
370 



1127 



( 2 ) INFORMATION FOR SEQ ID NO : 2 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 374 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: 



protein 



• # 



41 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Ser Arg lie Val Arg Glu Leu Ala Leu Val Val Thr Leu Leu 
1 \ 5 10 15 

His Leu Thr ArA Val Gly Leu Ser Thr Cys Pro Ala Asp Cys His Cys 
20\ 25 30 




Pro Leu Glu Ala Pro Lys Cys Ala Pro Gly Val Gly Leu Val Arg Asp 
35 \ 40 45 

Gly Cys Gly Cys CysYLys Val Cys Ala Lys Gin Leu Asn Glu Asp Cys 
50 \ 55 60 

Arg Lys Thr Gin Pro Cys Asp His Thr Lys Gly Leu Glu Cys Asn Phe 
65 70\ 75 80 



Gly Ala Ser Ser Thr Ala Deu Lys Gly lie Cys Arg Ala Gin Ser Glu 
85 \ 90 95 



Gly Arg Pro Cys Glu Tyr AsnNSer Arg lie Tyr Gin Asn Gly Glu Ser 
100 \ 105 110 



-A 



Phe Gin Pro Asn Cys Lys His Gln\ Cys Thr Cys lie Gly Trp Arg Arg 
115 120\ 125 

\ 

Gly Ala Cys lie Pro Leu Cys Pro^G*rP e Glu Leu Ser Leu Pro Asn Leu 
130 135 f \ J 140 



Gly Cys Pro Asn Pro Arg Leu Val 
145 150 




T/ir Gly/Gln Cys Cys Glu 

160 



Glu Trp Val Cys Asp GiJT*?Sgp==S€£*^^'Lysi Asp Pro Met Glu Asp Gin 
165 ^ ' 170 \ 175 

Asp Gly Leu Leu Gly Lys Gly Leu Gly Phe Asp Ala Ser Glu Val Glu 
180 185 \ 190 

Leu Thr Arg Asn Asn Glu Leu lie Ala Val Gly Lys Gly Ser Ser Leu 
195 200 \ 205 



Lys Arg Leu Pro Val Phe Gly Met Glu Pro Arg lie Leu Tyr Asn Pro 
210 215 22o\ 



Leu Gin Gly Gin Lys Cys lie Val Gin Thr Thr Ser Trp Ser Gin Cys 
225 230 235 \ 240 



Ser Lys Thr Cys Gly Thr Gly lie Ser Thr Arg Val Thr« Asn Asp Asn 
245 250 \ 255 



Pro Glu Cys Arg Leu Val Lys Glu Thr Arg lie Cys Glu Val Arg Pro 
260 265 27 V C 



Cys Gly Gin Pro Val Tyr Ser Ser Leu Lys Lys Gly Lys Lys Cys Ser 
275 280 285 



42 



Lys Thr Lys\Lys Ser Pro Glu Pro Val Arg Phe Thr Tyr Ala Gly Cys 
290 \ 295 300 

Leu Ser Val Lyk Lys Tyr Arg Pro Lys Tyr Cys Gly Ser Cys' Val Asp 
305 \ 310 315 320 

Gly Arg Cys Cys Tkr Pro Gin Leu Thr Arg Thr Val Lys Met Arg Phe 
32\ 330 335 

Pro Cys Glu Asp Gly Glu Thr Phe Ser Lys Asn Val Met Met lie Gin 
340 \ 345 350 



er Ser Lys Cys Asn Tyr ftsn Cys Pro His Ala Lys Lys Gin Arg Phe 
355 \ 360 365 




Pro Ser Thr Gly Cys Ser 
370 



(2) INFORMATION FOR SEQ ID NO :^: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino a\ids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION 



Met Ser Ser Ser Thr Phe Arg Thr Leu Ala Val Ala Val Thr Leu Leu 
1 5 10 \ 15 



His Leu Thr Arg Leu Ala Leu Ser Thr Cys Pro\ Ala Ala Cys His Cys 
20 25 \ 30 

Pro Leu Glu Ala Pro Lys Cys Ala Pro Gly Val Gly Leu Val Arg Asp 
35 40 \ 45 



Gly Cys Gly Cys Cys Lys Val Cys Ala Lys Gin Leu *A.sn Glu Asp Cys 
50 55 60 



\ 



Ser Lys Thr Gin Pro Cys Asp His Thr Lys Gly Leu Glu Cys Asn Phe 
65 70 75 \ 80 



V 



Gly Ala Ser Ser Thr Ala Leu Lys Gly lie Cys Arg Ala Gin Ser Glu 
85 90 \ 95 



Gly Arg Pro Cys Glu Tyr Asn Ser Arg lie Tyr Gin Asn Gly v Glu Ser 
100 105 



no\ 



43 

Phe Gin pVo Asn Cys Lys His Gin Cys Thr Cys lie Asp Gly Ala Val 

1VS 120 125 

Gly Cys lie }?ro Leu Cys Pro Gin Glu Leu Ser Leu Pro Asn Leu Gly 

130 \ 135 140 

Cys Pro Asn Pro\Arg Leu Val Lys Val Ser Gly Gin Cys Cys Glu Glu 

145 \ 150 155 160 

Trp Val Cys Asp Gl\i Asp Ser lie Lys Asp Ser Leu Asp Asp Gin Asp 

165\ 170 175 



Asp Leu Leu Gly Leu Asp Ala Ser Glu Val Glu Leu Thr Arg Asn Asn 
180 \ 185 190 



Glu Leu lie Ala lie Gly Lys Gly Ser Ser Leu Lys Arg Leu Pro Val 
195 \ 200 205 



\ 



Phe Gly Thr Glu Pro Arg Val \Leu Phe Asn Pro Leu His Ala His Gly 
210 215 \ 220 



Gin Lys Cys lie Val Gin Thr Thr\Ser Trp Ser Gin Cys Ser Lys Ser 

225 230 \ 235 240 

Cys Gly Thr Gly lie Ser Thr Arg VaSl Thr Asn Asp Asn Pro Glu Cys 
245 \2_5O 255 

Arg Leu Val Lys Glu Thr Arg lie Cys Giu A/al Arg Pro Cys Gly Gin 

260 265\ \/" v / 270 

Pro Val Tyr Ser Ser Leu Lys Lys Gly VLys\Lys^5ys Ser Lys Thr Lys 

275 2 < ^^^ Esa ^\ 285 



Lys Ser Pro Glu Pro Val Arg Phe Thr Tyr Ala^Gly Cys Ser Ser Val 
290 295 \300 



Lys Lys Tyr Arg Pro Lys Tyr Cys Gly Ser Cys Val Asp Gly Arg Cys 
305 310 315 \ 320 



Cys Thr Pro Leu Gin Thr Arg Thr Val Lys Met Arg Phe Arg Cys Glu 
325 330 \ 335 

Asp Gly Glu Met Phe Ser Lys Asn Val Met Met lie Gln^Ser Cys Lys 
340 345 "350 

Cys Asn Tyr Asn Cys Pro His Pro Asn Glu Ala Ser Phe Arg Leu Tyr 
355 360 365 

Ser Leu Phe Asn 
370 

INFORMATION FOR SEQ ID NO: 4: 




(i) SEQUENCE CHARACTERISTICS: 

■ (A) LENGTH: 345 amino acids 
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(B) T*PE: amino acid 

(C) StI^USIDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Leu Ala Ser Val\Ala Gly Pro lie Ser Leu Ala Leu Val Leu Leu 
1 5 \ 10 15 

Ala Leu Cys Thr Arg PAo Ala Thr Gly Gin Asp Cys Ser Ala Gin Cys 
20 \ 25 30 

Gin Cys Ala Ala Glu Ala ^la Pro His Cys Pro Ala Gly Val Ser Leu 
35 \ 40 45 

Val Leu Asp Gly Cys Gly Cyfa Cys Arg Val Cys Ala Lys Gin Leu Gly 
50 55\ 60 

Glu Leu Cys Thr Glu Arg Asp Pro_ Cys Asp Pro His Lys Gly Leu Phe 
65 70 \ 75 80 



Cys Asp Phe Gly Ser Pro Ala Asn Arg Lys lie Gly Val Cys Thr Ala 
85 \ 90 95 



Lys Asp Gly Ala Pro Cys Val 
100 

Glu Ser Phe Gin Ser Ser Cys Lyi 
115 120 

Ala Val Gly Cys Val 

130 135 




ly Ser Val Tyr Arg Ser Gly 
5/ 110 

Cys Leu Asp Gly 
125 

f&r 'Met Asp Val Arg Leu Pro Ser 
140 



Pro Asp Cys Pro Phe Pro Arg Arg Val Lys Leu Pro Gly Lys Cys Cys 
145 150 \ 155 160 



Lys Glu Trp Val Cys Asp Glu Pro Lys Asp Vrg Thr Ala Val Gly Pro 
165 170 \ 175 

Ala Leu Ala Ala Tyr Arg Leu Glu Asp Thr Phe Gly Pro Asp Pro Thr 
180 185 \ 190 

Met Met Arg Ala Asn Cys Leu Val Gin Thr Thr \31u Trp Ser Ala Cys 
195 200 \ 205 



Ser Lys Thr Cys Cys Met Gly lie Ser Thr Arg Vau Thr Asn Asp Asn 

220 



210 



215 



Thr Phe Cys Arg Leu Glu Lys Gin Ser Arg Leu Cys 'Met Val Arg Pro 
225 230 235 ^ 240 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CGCGGGTACC AGGTAGCATT TAGTCCCTAA 



30 



* 



46 



L 8 

M 

-eg 



(2) INFORMATIONXFOR SEQ ID NO : 7 : 



( i ) SEQUENCFACHARACTERISTICS : 




(A) LENGTH: 27 base pairs 

(B) TYPE-V nucleic acid 

(C) STRANHEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPEV DNA (genomic) 




(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 7 : 

\ 

AAAGGATCCA CAATGAGCTC CCGAATC 

\ 

(2) INFORMATION FOR SEQ ID NOt: 8 : 

(i) SEQUENCE CHARACTE 

(A) LENGTH: 58 has 

(B) TYPE: nucleic a) 

(C) STRANDEDN 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: DNA (genomife) 




27 



(xi) SEQUENCE DESCRIPTION: SEQ ID NC^: 8 : 
CGCTCTAGAT TAAGCGTAGT CTGGGACGTC GTATGGGTAT TGGAACAGCC TGTAGAAG 



58 



